Data Quality Measurement With Threshold Using Genetic Algorithm
نویسنده
چکیده
Our basic idea is to employ association rule for the purpose of data quality measurement. Strong rule generation is an important area of data mining. We purpose a Genetic Algorithm to generate high quality Association Rules with four metrics they are confidence, completeness, interestingness and comprehensibility. These metrics are combined as an objective fitness function. Fitness function evaluates the quality of each rule. The advantage of using genetic algorithm is to discover high level prediction rules is that they perform a global search and cope better with attribute interaction than the greedy rule induction algorithm often used in data mining. Association Rule Mining is one of the most applicable techniques in data mining. Association rules that satisfy threshold specified by the user are referred to as strong association rules and a considered interesting.
منابع مشابه
Comparison of Linear and Threshold Models for Estimation Genetic and Phenotypic Parameters of Success of Conception at First Service and Inseminations to Conception in Holstein Cattles in East Azarbayjan Province
In this research genetic and phenotypic parameters were estimated using linear and threshold models, for reproductive traits, data from 6 large industrial dairy herd of East Azerbaijan province collected by Agriculture Jihad Organization during 10 years (2001-2010). Best linear unbiased predictions of traits breeding values were estimated using Restricted Maximum Likelihood method by WOMBAT sof...
متن کاملComparison of Linear and Threshold Models for Estimation Genetic and Phenotypic Parameters of Success of Conception at First Service and Inseminations to Conception in Holstein Cattles in East Azarbayjan Province
In this research genetic and phenotypic parameters were estimated using linear and threshold models, for reproductive traits, data from 6 large industrial dairy herd of East Azerbaijan province collected by Agriculture Jihad Organization during 10 years (2001-2010). Best linear unbiased predictions of traits breeding values were estimated using Restricted Maximum Likelihood method by WOMBAT sof...
متن کاملModeling of measurement error in refractive index determination of fuel cell using neural network and genetic algorithm
Abstract: In this paper, a method for determination of refractive index in membrane of fuel cell on basis of three-longitudinal-mode laser heterodyne interferometer is presented. The optical path difference between the target and reference paths is fixed and phase shift is then calculated in terms of refractive index shift. The measurement accuracy of this system is limited by nonlinearity erro...
متن کاملIdentifying Significant Health Measurement of Equipment Affecting the Quality of a Continuous Product (Case Study: Unit 2, Parand Gas Turbine Power Plant)
Objective: Majorproducers consider quality as a major criterion in decision making.Quality characteristics are affected by maintenance and repair decisions. In this study, a model is developed to determine significant measurements of production equipment affecting the quality of a continuous product to identify which measurements are more critical in terms of quality. Methods: Diversity of par...
متن کاملCombining Neural Network with Genetic Algorithm for prediction of S4 Parameter using GPS measurement
The ionospheric plasma bubbles cause unpredictable changes in the ionospheric electron density. These variations in the ionospheric layer can cause a phenomenon known as the ionospheric scintillation. Ionospheric scintillation could affect the phase and amplitude of the radio signals traveling through this medium. This phenomenon occurs frequently around the magnetic equator and in low latitu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012